Unit selection for speech synthesi target cos

نویسنده

  • Soufiane Rouibia
چکیده

This paper presents a new approach to unit selection for corpus-based speech synthesis, in which the units are selected according to acoustic criteria. In a learning stage, an acoustic clustering is carried out using context dependent HMM. During synthesis, an acoustic target is generated and segmented in the required diphone sequence. For each diphone to be synthesized, a pre-selection module determines the N-best instances that match this acoustic target. From these candidates, the optimal unit sequence is then obtained by minimizing a concatenation cost through dynamic programming. Objective as well as subjective tests are carried out which shows the relevance of the proposed method.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Study on Unit-Selection and Statistical Parametric Speech Synthesis Techniques

One of the interesting topics on multimedia domain is concerned with empowering computer in order to speech production. Speech synthesis is granting human abilities to the computer for speech production. Data-based approach and process-based approach are the two main approaches on speech synthesis. Each approach has its varied challenges. Unit-selection speech synthesis and statistical parametr...

متن کامل

Duration Modeling for Ar Synthesi

Duration modeling is a fundamental task of prosody generation for Text To Speech (TTS) systems. The objective of this task is to predict the duration of a speech unit from its phonological representation. Duration modeling has a significant influence on the intelligibility and the naturalness of the synthesized speech. This paper presents a Neural Network (NN) based approach to predict the dura...

متن کامل

Maximum likelihood unit selection for corpus-based speech synthesis

Corpus-based speech synthesis systems deliver a considerable synthesis quality since the unit selection approaches have been optimized in the last decade. Unit selection attempts to find the best combination of speech unit sequences in an inventory so that the perceptual differences between expected (natural) and synthesized signals are as low as possible. However, mismatches and distortions ar...

متن کامل

Symbolic vs. acoustics-based style control for expressive unit selection

The present paper addresses the issue of flexibility in expressive unit selection speech synthesis by using different style selection techniques. We select units from a mixed-style unit selection database, using either forced style switching, no control, symbolic target cost, or acoustic target cost as a style selection criterion. We assess the effect of selection technique, feature weight and ...

متن کامل

Adaptive manipulation of non-uniform synthesis units using multi-level unit transcription

A synthesis-by-rule system based on the selective use of non-uniform synthesis units has been developed. This system uses a natural speech database and an algorithm which searches the database for the optimal speech segment to be used as the synthesis unit. Because of flexible use of synthesis units, this scheme has great advantages, especially in expressing many coarticulat~ry variations. Howe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005